Serveur d'exploration sur la recherche en informatique en Lorraine

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Effect of Ghost Character Theory on Arabic Script Based Languages Character Recognition

Identifieur interne : 003703 ( Main/Exploration ); précédent : 003702; suivant : 003704

Effect of Ghost Character Theory on Arabic Script Based Languages Character Recognition

Auteurs : Mohamed Imran Razzak [Pakistan] ; Abdel Belaïd [France] ; Syed Afaq Hussain [Pakistan]

Source :

RBID : Hal:inria-00579666

Abstract

Arabic script is used by more than 1/4th population of the world in the form of different languages like Arabic, Persian, Urdu, Sindhi, Pashto etc but each language have its own words meaning. The set of شhas 58 alphabets. Arabic script based languages character recognition is difficult task due to complexities involved in this script not exist in other script. The analysis of the Arabic script is very complicated due to its use of diacritical marks associated with each character and written in many fonts and style. This script has gain very less intention by the researcher. This paper present a novel technique named Ghost Character Recognition Theory that will helps to develop a Multilanguage character recognition system for Arabic script based languages based on Ghost Character Theory. The main benefit of proposed approach is that it will works for all Arabic script based languages by doing effort for ghost character (basic skeleton) and developing dictionary for every language. By handling all Arabic script based languages many issues will arise like recognition rate as compared to system for specific languages, but in general it is not big issue for multilingual system and at the end we will get multilingual character recognition system.

Url:


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Effect of Ghost Character Theory on Arabic Script Based Languages Character Recognition</title>
<author>
<name sortKey="Razzak, Mohamed Imran" sort="Razzak, Mohamed Imran" uniqKey="Razzak M" first="Mohamed Imran" last="Razzak">Mohamed Imran Razzak</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-200534" status="VALID">
<orgName>Department of Computer Science [IIU - Islamabad]</orgName>
<desc>
<address>
<addrLine>Sector H-10, Islamabad</addrLine>
<country key="PK"></country>
</address>
<ref type="url">http://www.iiu.edu.pk/</ref>
</desc>
<listRelation>
<relation active="#struct-311524" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-311524" type="direct">
<org type="institution" xml:id="struct-311524" status="INCOMING">
<orgName>International Islamic University</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Pakistan</country>
</affiliation>
</author>
<author>
<name sortKey="Belaid, Abdel" sort="Belaid, Abdel" uniqKey="Belaid A" first="Abdel" last="Belaïd">Abdel Belaïd</name>
<affiliation wicri:level="1">
<hal:affiliation type="researchteam" xml:id="struct-2362" status="OLD">
<orgName>READ</orgName>
<orgName type="acronym">READ</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
<listRelation>
<relation active="#struct-160" type="direct"></relation>
<relation name="UMR7503" active="#struct-441569" type="indirect"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-300291" type="indirect"></relation>
<relation active="#struct-300292" type="indirect"></relation>
<relation active="#struct-300293" type="indirect"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-160" type="direct">
<org type="laboratory" xml:id="struct-160" status="OLD">
<orgName>Laboratoire Lorrain de Recherche en Informatique et ses Applications</orgName>
<orgName type="acronym">LORIA</orgName>
<desc>
<address>
<addrLine>Campus Scientifique BP 239 54506 Vandoeuvre-lès-Nancy Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.loria.fr</ref>
</desc>
<listRelation>
<relation name="UMR7503" active="#struct-441569" type="direct"></relation>
<relation active="#struct-300009" type="direct"></relation>
<relation active="#struct-300291" type="direct"></relation>
<relation active="#struct-300292" type="direct"></relation>
<relation active="#struct-300293" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle name="UMR7503" active="#struct-441569" type="indirect">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="ISNI">0000000122597504</idno>
<idno type="IdRef">02636817X</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300009" type="indirect">
<org type="institution" xml:id="struct-300009" status="VALID">
<orgName>Institut National de Recherche en Informatique et en Automatique</orgName>
<orgName type="acronym">Inria</orgName>
<desc>
<address>
<addrLine>Domaine de VoluceauRocquencourt - BP 10578153 Le Chesnay Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/en/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300291" type="indirect">
<org type="institution" xml:id="struct-300291" status="OLD">
<orgName>Université Henri Poincaré - Nancy 1</orgName>
<orgName type="acronym">UHP</orgName>
<date type="end">2011-12-31</date>
<desc>
<address>
<addrLine>24-30 rue Lionnois, BP 60120, 54 003 NANCY cedex, France</addrLine>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300292" type="indirect">
<org type="institution" xml:id="struct-300292" status="OLD">
<orgName>Université Nancy 2</orgName>
<date type="end">2011-12-31</date>
<desc>
<address>
<addrLine>91 avenue de la Libération, BP 454, 54001 Nancy cedex</addrLine>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300293" type="indirect">
<org type="institution" xml:id="struct-300293" status="OLD">
<orgName>Institut National Polytechnique de Lorraine</orgName>
<orgName type="acronym">INPL</orgName>
<date type="end">2011-12-31</date>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">Nancy</settlement>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
</placeName>
<orgName type="university">Université Nancy 2</orgName>
<orgName type="institution" wicri:auto="newGroup">Université de Lorraine</orgName>
<placeName>
<settlement type="city">Nancy</settlement>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
</placeName>
<orgName type="university">Institut national polytechnique de Lorraine</orgName>
<orgName type="institution" wicri:auto="newGroup">Université de Lorraine</orgName>
</affiliation>
</author>
<author>
<name sortKey="Hussain, Syed Afaq" sort="Hussain, Syed Afaq" uniqKey="Hussain S" first="Syed Afaq" last="Hussain">Syed Afaq Hussain</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-200535" status="VALID">
<orgName>Department of Computer Science [AU - Islamabad]</orgName>
<desc>
<address>
<addrLine>Service Road E-9 Islamabad 44000 Pakistan</addrLine>
<country key="PK"></country>
</address>
<ref type="url">http://www.au.edu.pk/</ref>
</desc>
<listRelation>
<relation active="#struct-349222" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-349222" type="direct">
<org type="institution" xml:id="struct-349222" status="INCOMING">
<orgName>Air University - Islamabad</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Pakistan</country>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">HAL</idno>
<idno type="RBID">Hal:inria-00579666</idno>
<idno type="halId">inria-00579666</idno>
<idno type="halUri">https://hal.inria.fr/inria-00579666</idno>
<idno type="url">https://hal.inria.fr/inria-00579666</idno>
<date when="2009-02-02">2009-02-02</date>
<idno type="wicri:Area/Hal/Corpus">001E42</idno>
<idno type="wicri:Area/Hal/Curation">001E42</idno>
<idno type="wicri:Area/Hal/Checkpoint">002D78</idno>
<idno type="wicri:explorRef" wicri:stream="Hal" wicri:step="Checkpoint">002D78</idno>
<idno type="wicri:Area/Main/Merge">003789</idno>
<idno type="wicri:Area/Main/Curation">003703</idno>
<idno type="wicri:Area/Main/Exploration">003703</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Effect of Ghost Character Theory on Arabic Script Based Languages Character Recognition</title>
<author>
<name sortKey="Razzak, Mohamed Imran" sort="Razzak, Mohamed Imran" uniqKey="Razzak M" first="Mohamed Imran" last="Razzak">Mohamed Imran Razzak</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-200534" status="VALID">
<orgName>Department of Computer Science [IIU - Islamabad]</orgName>
<desc>
<address>
<addrLine>Sector H-10, Islamabad</addrLine>
<country key="PK"></country>
</address>
<ref type="url">http://www.iiu.edu.pk/</ref>
</desc>
<listRelation>
<relation active="#struct-311524" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-311524" type="direct">
<org type="institution" xml:id="struct-311524" status="INCOMING">
<orgName>International Islamic University</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Pakistan</country>
</affiliation>
</author>
<author>
<name sortKey="Belaid, Abdel" sort="Belaid, Abdel" uniqKey="Belaid A" first="Abdel" last="Belaïd">Abdel Belaïd</name>
<affiliation wicri:level="1">
<hal:affiliation type="researchteam" xml:id="struct-2362" status="OLD">
<orgName>READ</orgName>
<orgName type="acronym">READ</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
<listRelation>
<relation active="#struct-160" type="direct"></relation>
<relation name="UMR7503" active="#struct-441569" type="indirect"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-300291" type="indirect"></relation>
<relation active="#struct-300292" type="indirect"></relation>
<relation active="#struct-300293" type="indirect"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-160" type="direct">
<org type="laboratory" xml:id="struct-160" status="OLD">
<orgName>Laboratoire Lorrain de Recherche en Informatique et ses Applications</orgName>
<orgName type="acronym">LORIA</orgName>
<desc>
<address>
<addrLine>Campus Scientifique BP 239 54506 Vandoeuvre-lès-Nancy Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.loria.fr</ref>
</desc>
<listRelation>
<relation name="UMR7503" active="#struct-441569" type="direct"></relation>
<relation active="#struct-300009" type="direct"></relation>
<relation active="#struct-300291" type="direct"></relation>
<relation active="#struct-300292" type="direct"></relation>
<relation active="#struct-300293" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle name="UMR7503" active="#struct-441569" type="indirect">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="ISNI">0000000122597504</idno>
<idno type="IdRef">02636817X</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300009" type="indirect">
<org type="institution" xml:id="struct-300009" status="VALID">
<orgName>Institut National de Recherche en Informatique et en Automatique</orgName>
<orgName type="acronym">Inria</orgName>
<desc>
<address>
<addrLine>Domaine de VoluceauRocquencourt - BP 10578153 Le Chesnay Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/en/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300291" type="indirect">
<org type="institution" xml:id="struct-300291" status="OLD">
<orgName>Université Henri Poincaré - Nancy 1</orgName>
<orgName type="acronym">UHP</orgName>
<date type="end">2011-12-31</date>
<desc>
<address>
<addrLine>24-30 rue Lionnois, BP 60120, 54 003 NANCY cedex, France</addrLine>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300292" type="indirect">
<org type="institution" xml:id="struct-300292" status="OLD">
<orgName>Université Nancy 2</orgName>
<date type="end">2011-12-31</date>
<desc>
<address>
<addrLine>91 avenue de la Libération, BP 454, 54001 Nancy cedex</addrLine>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300293" type="indirect">
<org type="institution" xml:id="struct-300293" status="OLD">
<orgName>Institut National Polytechnique de Lorraine</orgName>
<orgName type="acronym">INPL</orgName>
<date type="end">2011-12-31</date>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">Nancy</settlement>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
</placeName>
<orgName type="university">Université Nancy 2</orgName>
<orgName type="institution" wicri:auto="newGroup">Université de Lorraine</orgName>
<placeName>
<settlement type="city">Nancy</settlement>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
</placeName>
<orgName type="university">Institut national polytechnique de Lorraine</orgName>
<orgName type="institution" wicri:auto="newGroup">Université de Lorraine</orgName>
</affiliation>
</author>
<author>
<name sortKey="Hussain, Syed Afaq" sort="Hussain, Syed Afaq" uniqKey="Hussain S" first="Syed Afaq" last="Hussain">Syed Afaq Hussain</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-200535" status="VALID">
<orgName>Department of Computer Science [AU - Islamabad]</orgName>
<desc>
<address>
<addrLine>Service Road E-9 Islamabad 44000 Pakistan</addrLine>
<country key="PK"></country>
</address>
<ref type="url">http://www.au.edu.pk/</ref>
</desc>
<listRelation>
<relation active="#struct-349222" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-349222" type="direct">
<org type="institution" xml:id="struct-349222" status="INCOMING">
<orgName>Air University - Islamabad</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Pakistan</country>
</affiliation>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Arabic script is used by more than 1/4th population of the world in the form of different languages like Arabic, Persian, Urdu, Sindhi, Pashto etc but each language have its own words meaning. The set of شhas 58 alphabets. Arabic script based languages character recognition is difficult task due to complexities involved in this script not exist in other script. The analysis of the Arabic script is very complicated due to its use of diacritical marks associated with each character and written in many fonts and style. This script has gain very less intention by the researcher. This paper present a novel technique named Ghost Character Recognition Theory that will helps to develop a Multilanguage character recognition system for Arabic script based languages based on Ghost Character Theory. The main benefit of proposed approach is that it will works for all Arabic script based languages by doing effort for ghost character (basic skeleton) and developing dictionary for every language. By handling all Arabic script based languages many issues will arise like recognition rate as compared to system for specific languages, but in general it is not big issue for multilingual system and at the end we will get multilingual character recognition system.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>France</li>
<li>Pakistan</li>
</country>
<region>
<li>Grand Est</li>
<li>Lorraine (région)</li>
</region>
<settlement>
<li>Nancy</li>
</settlement>
<orgName>
<li>Institut national polytechnique de Lorraine</li>
<li>Université Nancy 2</li>
<li>Université de Lorraine</li>
</orgName>
</list>
<tree>
<country name="Pakistan">
<noRegion>
<name sortKey="Razzak, Mohamed Imran" sort="Razzak, Mohamed Imran" uniqKey="Razzak M" first="Mohamed Imran" last="Razzak">Mohamed Imran Razzak</name>
</noRegion>
<name sortKey="Hussain, Syed Afaq" sort="Hussain, Syed Afaq" uniqKey="Hussain S" first="Syed Afaq" last="Hussain">Syed Afaq Hussain</name>
</country>
<country name="France">
<region name="Grand Est">
<name sortKey="Belaid, Abdel" sort="Belaid, Abdel" uniqKey="Belaid A" first="Abdel" last="Belaïd">Abdel Belaïd</name>
</region>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 003703 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 003703 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Lorraine
   |area=    InforLorV4
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Hal:inria-00579666
   |texte=   Effect of Ghost Character Theory on Arabic Script Based Languages Character Recognition
}}

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jun 10 21:56:28 2019. Site generation: Fri Feb 25 15:29:27 2022